Generalized Bayes Quantification Learning under Dataset Shift

نویسندگان

چکیده

Quantification learning is the task of prevalence estimation for a test population using predictions from classifier trained on different population. methods assume that sensitivities and specificities are either perfect or transportable training to These assumptions inappropriate in presence dataset shift, when misclassification rates not representative those under shift has been addressed only single-class (categorical) assuming knowledge true labels small subset We propose generalized Bayes quantification (GBQL) uses entire compositional probabilistic classifiers allows uncertainty class limited labeled data. Instead positing full model, we use model-free Bayesian estimating equation approach data Kullback–Leibler loss-functions based first-moment assumption. The idea will be useful analysis general as it robust generating mechanisms 0’s 1’s outputs thereby including categorical special case. show how our method yields existing approaches cases. Extension an ensemble GBQL multiple yielding inference inclusion poor discussed. outline fast efficient Gibbs sampler rounding coarsening approximation loss functions. establish posterior consistency, asymptotic normality valid coverage interval estimates GBQL, which first theoretical results local also finite sample concentration rate. Empirical performance demonstrated through simulations real with evident shift. Supplementary materials this article available online.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust supervised learning under uncertainty in dataset shift

When machine learning is deployed in the real world, its performance can be significantly undermined because test data may follow a different distribution from training data. To build a reliable machine learning system in such a scenario, we propose a supervised learning framework that is explicitly robust to the uncertainty of dataset shift. Our robust learning framework is flexible in modelin...

متن کامل

Dataset Shift Adaptive Computation and Machine Learning

All rights reserved. No part of this book may be reproduced in any form by any electronic or mechanical means (including photocopying, recording, or information storage and retrieval) without permission in writing from this is the last candidate. next esc will revert to uncompleted text. he publisher. Overview Dataset shift is a challenging situation where the joint distribution of inputs and o...

متن کامل

Discriminative Learning Under Covariate Shift

We address classification problems for which the training instances are governed by an input distribution that is allowed to differ arbitrarily from the test distribution—problems also referred to as classification under covariate shift. We derive a solution that is purely discriminative: neither training nor test distribution are modeled explicitly. The problem of learning under covariate shif...

متن کامل

Active Transfer Learning under Model Shift

Transfer learning algorithms are used when one has sufficient training data for one supervised learning task (the source task) but only very limited training data for a second task (the target task) that is similar but not identical to the first. These algorithms use varying assumptions about the similarity between the tasks to carry information from the source to the target task. Common assump...

متن کامل

Robust Supervised Learning under Distribution Shift Uncertainty

Distributionally Robust Supervised Learning (DRSL) is necessary for building reliable machine learning systems. When machine learning is deployed in the real world, its performance can be significantly degraded because test data may follow a different distribution from training data. Previous DRSL minimizes the loss for the worst-case test distribution. However, our theoretical analyses show th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of the American Statistical Association

سال: 2021

ISSN: ['0162-1459', '1537-274X', '2326-6228', '1522-5445']

DOI: https://doi.org/10.1080/01621459.2021.1909599